Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.org

نویسندگان

چکیده

This paper introduces a novel evaluation methodology for entity resolution algorithms. It is motivated by PatentsView.org, U.S. Patents and Trademarks Office patent data exploration tool that disambiguates inventors using an algorithm. We provide collection tailored performance estimators account sampling biases. Our approach simple, practical principled -- key characteristics allow us to paint the first representative picture of PatentsView's disambiguation performance. used inform users reliability comparison competing

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lessons learned through leadership.

Part of the Rehabilitation and Therapy Commons This Article is brought to you for free and open access by the Jefferson Digital Commons. The Jefferson Digital Commons is a service of Thomas Jefferson University's Center for Teaching and Learning (CTL). The Commons is a showcase for Jefferson books and journals, peer-reviewed scholarly publications, unique historical collections from the Univers...

متن کامل

Achievements of the Cochrane Iran Associate Centre: Lessons Learned

Healthcare decision-making is a process that mainly depends on evidence and involves increasing numbers of stakeholders, including the consumers. Cochrane evidence responds to this challenge by identifying, appraising, integrating and synthesizing high-quality evidence. Recently, a collaborative effort has been initiated in Iran with Cochrane to establish a representati...

متن کامل

Mentoring through teamwork: lessons learned.

This essay is simply a highly personal account of how one mentor has joined with a team of mentors, combined with special "permanent" employees, lively group interactions and high expectations for trainees to provide a fertile environment for the training of scientists. I also need to acknowledge the deep personal friendships that have developed and intensified with the Rankin Lab trainees and ...

متن کامل

Crowdsourcing Algorithms for Entity Resolution

In this paper, we study a hybrid human-machine approach for solving the problem of Entity Resolution (ER). The goal of ER is to identify all records in a database that refer to the same underlying entity, and are therefore duplicates of each other. Our input is a graph over all the records in a database, where each edge has a probability denoting our prior belief (based on Machine Learning mode...

متن کامل

Facilitating Knowledge Sharing Through Lessons Learned System

Recently, many organizations realize that knowledge is a strategic tool for maintaining organizational performance. With the realization that knowledge is a core resource, organizations are now attempting to manage knowledge in a more systematic and more effective way. The theory of organizational knowledge creation suggests the sharing of tacit knowledge is a critical component of successful k...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The American Statistician

سال: 2023

ISSN: ['0003-1305', '1537-2731']

DOI: https://doi.org/10.1080/00031305.2023.2191664